Search Result

Select

Survey of large-scale resource description framework data partitioning methods in distributed environment

YANG Cheng, LU Jiamin, FENG Jun

Journal of Computer Applications 2020, 40 (11): 3184-3191. DOI: 10.11772/j.issn.1001-9081.2020040539

Abstract （410）

PDF （623KB）（431）

Save

With the rapid development of knowledge graph and its wide usage in various vertical domains, the requirements for efficient processing of Resource Description Framework (RDF) data has increasingly become a new topic in the field of modern big data management. RDF is a data model proposed by W3C to describe knowledge graph entities and inter-entity relationships. In order to effectively cope with the storage and query of the large-scale RDF data, many scholars consider managing RDF data in a distributed environment. The key problem faced by the distributed storage of RDF data is data partitioning, and the performance of Simple Protocol and RDF Query Language (SPARQL) queries is largely determined by the results of partitioning. From the perspective of data partitioning, two types:graph structure-based RDF data partitioning methods and semantics-based RDF data partitioning methods, were mainly focused on and described in depth. The former include multi-granularity hierarchical partitioning, template partitioning and clustering partitioning, and are suitable for the wide semantic categories scenes of general domain query, while the latter include hash partitioning, vertical partitioning and pattern partitioning, and are more suitable for the environments of the relatively fixed semantic categories of vertical domain query. In addition, several typical partitioning methods were compared and analyzed to provide enlightenment for the future research on RDF data partitioning methods. Finally, the future research directions of RDF data partitioning methods were summarized.

Reference | Related Articles | Metrics

Select

Spatio-temporal index method for moving objects in road network based on HBase

FENG Jun, LI Dingsheng, LU Jiamin, ZHANG Lixia

Journal of Computer Applications 2018, 38 (6): 1575-1583. DOI: 10.11772/j.issn.1001-9081.2017122977

Abstract （509）

PDF （1599KB）（354）

Save

Hbase can only use key value query, it is not suitable for multidimensional query of mobile objects in road network, which leads to inefficiency in storing index and query. In order to solve this problem, an efficient HBase indexing framework for Road network Moving objects (RM-HBase) was designed and implemented on the basis of HBase storage structure. Firstly, the upper Hmaster and lower HregionServer of the primary HBase index structure were improved to solve the hot distribution problem of distributed cluster data and improve the query efficiency of spatial data. Secondly, the road network moving object index - Road Network tree (RN-tree) was proposed to solve the problem of "dead space" in space division and improve the query efficiency of road sections in the space at the same time. Then, based on the above improvements of HBase index, the query algorithms for spatio-temporal range query, spatial-temporal K Nearest Neighbor (KNN) query and moving object trajectory query were designed respectively. Finally, the Spatial-TEmporal HBase IndeX (STEHIX) framework based on HBase distributed database was selected as the contrast object, the performance of RM-HBase was respectively analyzed from two aspects of the performance of index framework and the efficiency of query algorithm. The experimental results show that, the proposed RM-HBase is superior to the STEHIX framework in both the performance of data equilibrium distribution and the query performance of spatio-temporal query algorithm, and it is helpful to promote the efficiency of spatial-temporal index for the moving object data in mass road network.

Reference | Related Articles | Metrics

Select

Software testing data generation technology based on software hierarchical model

XU Weishan, YU Lei, FENG Junchi, HOU Shaofan

Journal of Computer Applications 2016, 36 (12): 3454-3460. DOI: 10.11772/j.issn.1001-9081.2016.12.3454

Abstract （695）

PDF （1080KB）（415）

Save

Since Markov chain model based software testing does not consider the software structural information and has low ability of path coverage and fault detection, a new software testing model called software hierarchical testing model was proposed based on the combination of statistical testing and Markov chain model based testing. The software hierarchical testing model contains the interaction between software and external environment, and also describes the internal structural information of software. Besides, the algorithm for generating test data set was put forward:firstly, the test sequences conforming to the actual usage of software were generated; then the input data which covered software internal structure was generated for the test sequences. Finally, in the comparison experiments with software testing based on Markov chain, the new model satisfies the software testing sufficiency and improves the test data set's ability of path coverage and fault detection.

Reference | Related Articles | Metrics

Select

Single sample face recognition based on orthogonal gradient binary pattern

YANG Huixian CAI Yongyong ZHAi Yunlong LI Qiuqiu FENG Junpeng

Journal of Computer Applications 2014, 34 (2): 546-549.

Abstract （490）

PDF （590KB）（486）

Save

To overcome the limitations of traditional face recognition methods for single sample, an improved gradient face algorithm named Orthogonal Gradient Binary Pattern (OGBP), which is robust to variations of illumination, face expression and posture, was proposed. Firstly, the features of the image samples were extracted by orthogonal gradient binary pattern. Then the feature vectors of each direction were concatenated into the general feature vector for face recognition. Finally the Principle Component Analysis (PCA) method was used to reduce dimensions and the nearest neighbor classifier was used for face image classification and recognition. Experimental results on YALE and AR face database indicate that the proposed method is simple, effective and better than the original gradient face algorithm, and also has better performance in face description for single sample.